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P5 , Abstract. Analytic number theorists usually seek to show that sequences which appear 

^ ^ ' naturally in arithmetic are "well-distributed" in some appropriate sense. In various discrep- 

ancy problems, combinatorics researchers have analyzed limitations to equi-distribution, as 
have Fourier analysts when working with the "uncertainty principle" . In this article we find 
that these ideas have a natural setting in the analysis of distributions of sequences in analytic 
Uh ' number theory, formulating a general principle, and giving several examples. 

•z.' 

■3 

•^ ' 1. Introduction 

In this paper we investigate the hmitations to the equidistribution of interesting "arithmetic 
sequences" in arithmetic progressions and short intervals. Our discussions are motivated 
^ . by a general result of K.F. Roth [15] on irregularities of distribution, and a particular result 

of H. Maier [11] which imposes restrictions on the equidistribution of primes. 
O I If ^ is a subset of the integers in [1, x] with |^| = px then Roth proved that there exists 

Q ' N < X and an arithmetic progression a (mod q) with q < ^/x such that 

^' 

+H ' n^A, n<N ^ neA 

•^ ■ n=a (mod q) ri<N 

In other words, keeping away from sets of density or 1, there must be an arithmetic 
k> i progression in which the number of elements of ^ is a little different from the average. 

in I Following work of A. Sarkozy and J. Beck, J. Matousek and J. Spencer [12] showed that 

Roth's theorem is best possible, in that there is a set A containing ~ x/2 integers up to 
x, for which |#{n ^ A : n < N, n = a (mod q)} — i^{n G A: n < N}/q\ <^ x^^^ for all 
q and a with N < x. 

Roth's result concerns arbitrary sequences of integers, as considered in combinatorial 
number theory and harmonic analysis. We are more interested here in sets of integers that 
arise in arithmetic, such as the primes. In [11] H. Maier developed an ingenious method to 
show that for any A> 1 there are arbitrarily large x such that the interval (x, a;+ (logx)"^) 
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contains significantly more primes than usual (that is, > (1 + 5yi)(logx)"^~^ primes for 
some 6 A > 0) and also intervals (x,a: + (logx) ) containing significantly fewer primes than 
usual. Adapting his method J. Friedlander and A. Granville [3] showed that there are 
arithmetic progressions containing significantly more (and others with significantly fewer) 
primes than usual. A weak form of their result is that, for every A > 1 there exist large x 
and an arithmetic progression a (mod q) with (a, q) = 1 and q < x/(loga;)"^ such that 



(1.1) 7r{x;q,a 



TT 



(x) Tv{x) 



>A 



(f){q) (p{q) 



If we compare this to Roth's bound we note two differences: the discrepancy exhibited is 
much larger in (1.1) (being within a constant factor of the main term), but the modulus q 
is much closer to x (but not so close as to be trivial). 

Recently A. Balog and T. Wooley [1] proved that the sequence of integers that may 
be written as the sum of two squares also exhibits "Maier type" irregularities in some 
intervals {x,x + (logx)"^) for any fixed, positive A. While previously Maier's results on 
primes had seemed inextricably linked to the mysteries of the primes, Balog and Wooley's 
example suggests that such results should be part of a general phenomenon. Indeed, we 
will provide here a general framework for such results on irregularities of distribution, 
which will include, among other examples, the sequence of primes and the sequence of 
sums of two squares. Our results may be viewed as an "uncertainty principle" which 
establishes that most arithmetic sequences of interest are either not-so-well distributed in 
longish arithmetic progressions, or are not-so-well distributed in both short intervals and 
short arithmetic progressions. 

la. Examples. 

We now highlight this phenomenon! with several examples: For a given set of integers 
A, let ^(A^) denote the number of elements of A which are < A^, and A{N; q, a) denote 
those that are < N and = a (mod q). 

• We saw in Maier's theorem that the primes are not so well-distributed. We might ask 
whether there are subsets A of the primes up to x which are well-distributed? Fix u > 1. 
We show that for any x there exists y G {x/4, x) such that either 

(1.2a) \Ay)/y - A{x)/x\ -^u A{x)/x 

(meaning that the subset is poorly distributed in short intervals), or there exists some 
arithmetic progression a (mod i) with (a,£) = 1 and £ < x/(logx)", for which 



(1.2b) A{y;i,a 



A{y) A{x 



> 



u 



In other words, we find "Maier type" irregularities in the distribution of any subset of the 
primes. (If we had chosen A to be the primes = 5 (mod 7) then this is of no interest when 
we take a — 1,£ — 7. To avoid this minor technicality we can add "For a given finite set 
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of "bad primes" S, we can choose such an i for which (£, S) 
{£, S) = 1 means that (£, p) = I for all p E S. ) 



1". Here and henceforth 



• With probability 1 there are no "Maier type" irregularities in the distribution of 
randomly chosen subsets of the integers. Indeed such irregularities seem to depend on the 
subset having some arithmetic structure. So instead of taking subsets of all the integers, 
we need to take subsets of a set which already has some arithmetic structure. For example, 
define S^ to be the set of integers n having no prime factors in the interval [(log n)^~'^, log n] , 
so that S^{N) ~ (1 — e)A^. Notice that the primes are a subset of S^. Our results imply 
that any subset A of S^ is poorly distributed in that for any x there exists y G (a;/4, x) 
such that either (1.2a) holds, or there exists some arithmetic progression a (mod £) and 
£ < a;/(loga;)" with (a, £) = 1, for which a suitably modified (1.2b) holds (that is with (f){£) 
replaced by £llp\e, (iogx)i-<p<iogx(l " Vp))- 

• Let K be an algebraic number field with [K : Q] > 1. Let R denote the ring of 
integers of K and let C be an ideal class from the class group of R. Take A be the set 
of positive integers which are the norm of some (integral) ideal belonging to C. (In Balog 
and Wooley's example, A is the set of numbers of the form x'^ + y'^, with C the class of 
principal ideals in R = Z[i].) From our work it follows that the set A is poorly distributed 
in arithmetic progressions; that is, a suitably modified version of (1.2b) holds. Moreover, 
if we replace R by any order in K then either (1.2a) holds or a suitably modified version 
of (1.2b) holds (and we expect that, with some effort, one can prove that the suitably 
modified (1.2b) holds). 

• Let i3 be a given set of x integers and P be a given set of primes. Define iS(i3, V, z) 
to be the number of integers in B which do not have a prime factor p E V with p < z. 
Sieve theory is concerned with estimating S{B, V, z) under certain natural hypothesis for 
B,V and u := log a;/ log 2;. The fundamental lemma of sieve theory (see [7]) implies (for 
example when B is the set of integers in an interval) that 



S(B,V,z) 



- n 

pEV,p<z 



1 
1- - 

P 



< 



1 + 0(1) 

M log it 



. n 

pEV,p<z 



1 
1- - 

P 



for u < ^1/2+0(1). It is known that this result is essentially "best-possible" in that one can 
construct examples for which the bound is obtained (both as an upper and lower bound). 
However these bounds are obtained in quite special examples, and one might suspect that 
in many cases which one encounters, those bounds might be significantly sharpened. It 
turns out that these bounds cannot be improved for intervals B, when V contains at least 
a positive proportion of the primes: 

Corollary 1.1. Suppose that V is a given set of primes for which ^{p eV: p < y} ^ 

n{y) for all y E {^/z, z] . There exist constants c > such that for any u <^ -J~z there exist 
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intervals I± of length > z^ for which 

pEP,p<z 

and 5(7_,P,.)<{l-(^)"}|7_| H (^ " ; 

>. \ t= / J pev,p<z ^ ^ 

Moreover if u < (1 — o(l)) log log z/ log log log 2 then our intervals I± have length < 2""+^. 



• What about sieve questions in which the set of primes does not have positive lower 
density (in the set of primes)? If V contains too few primes then we should expect the 
sieve estimate to be very accurate, so we must insist on some lower bound: for instance 
that if q = Ylpev P then 

(1.3) ^_^^ > 60 log log log Q. 

p\<i 

(Note that '^p\„0-Ogp)/p < (1 + o(l)) loglogg, the bound being attained when q is the 
product of the primes up to some large y.) 

Corollary 1.2. Let q be a large square-free number, which satisfies (1.3), and define 
^ '■— (Wp\qP^^^Y^ f'^'^ ^ certain constant ci > . There exists a constant C2 > such that 
if Vz >u^ (log log g/ log 2)"^ then exist intervals I± of length at least z^ such that 

Y, l>{l + lK^n^|/+|, and Y. 1<{1-1K^"}^|/-|. 

nel+ nel- 

(n,q) = l {n,q) = l 



• The reduced residues (mod q) are expected to be distributed much like random 
numbers chosen with probability (p{q)/q- Indeed when (p{q)/q -^ this follows from work 
of C. Hooley [10]; and of H.L. Montgomery and R.C. Vaughan [13] who showed that 
#{n G [m,m + h) : (n, q) = 1} has Gaussian distribution with mean and variance equal 
to h4>{q)/q, as m varies over the integers, provided h is suitably large. This suggests that 
#{?! G [m,m + h) : (n, g) = 1} should be {1 + o(l)}(/i0(g)/g) provided h > log q, 
however by Corollary 1.2 this is not true for h = log q for any given A > 0, provided that 
J2p\qO-^8P)/p ^ loglogg (a condition satisfied by many highly composite q). 

In §6 we shall give further new examples of sequences to which our results apply. 
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lb. General results. 

Our main result (Theorem 3.1) is too technical to introduce at this stage. Instead we 
motivate our setup (postponing complete details to §2) and explain some consequences. 

Let A denote a sequence a{n) of non-negative real numbers. We are interested in 
determining whether the a{n) are well-distributed in short intervals and in arithmetic 
progressions, so let A{x) = J2n<x '^("') (^° if .4. is a set of positive integers then a{n) is its 
indicator function). Thinking oi A{x)/x as the average value of a(n), we may expect that 
if A is well-distributed in short intervals then 

(1.4) A{x + y)-A{x)^y^^, 

X 

for suitable y. 

To understand the distribution of A in arithmetic progressions, we begin with those 
n divisible by d. We will suppose that the proportion of A which is divisible by d is 
approximately h{d)/d where h{.) is a non-negative multiplicative function; in other words, 

(1.5) Ad{x):=J2^in)^^A{x), 

d\n 

for each d (or perhaps when (d, S) = 1, where 5 is a finite set of 'bad' primes). The reason 
for taking h{d) to be a multiplicative function is that for most sequences that appear in 
arithmetic one expects that the criterion of being divisible by an integer di should be 
"independent" of the criterion of being divisible by an integer d2 coprime to di. 

If the asymptotic behavior of A{x; q, a) for (g, 5) = 1 depends only on the g.c.d. of a 
and q then, by (1.5), we arrive at the prediction that, for {q,S) = 1, 

(1.6) A{x;q,a)^^^A{x), 

where 7g = np|n((P ~ 1)/(P ~ ^(p))) ^^^ fqi'^) is a certain non-negative multiplicative 
function of a for which fq{a) = fq{{a^ q)) (thus fq{a) is periodic (mod q)). In §2 we shall 
give an explicit description of fq in terms of h. 

In the spirit of Roth's theorem we ask how good is the approximation (1.6)? And, in 
the spirit of Maier's theorem we ask how good is the approximation (1.4)? 

Example 1. We take a{n) = 1 for all n. We may take iS = and h{n) = 1 for all n. 
Then fq{a) = 1 for all q and all a, and 7g = 1. Clearly both (1.6) and (1.4) are good 
approximations with an error of at most 1. 

Example 2. We take a{n) = 1 if n is prime and a{n) = otherwise. Then we may take 
5 = and h{n) = 1 if n = 1 and h{n) = if n > 1. Further fq{a) = 1 if (a, q) = I and 
fq{a) = otherwise, and 7g = (p{q)/q- The approximation (1.6) is then the prime number 
theorem for arithmetic progressions for small q < (logx)"^. Friedlander and Granville's 
result (1.1) sets limitations to (1.6), and Maier's result sets limitations to (1.4). 
Example 3. Take a{n) = 1 if n is the sum of two squares and a{n) = otherwise. Here 
we take S = {2}, and for odd prime powers p'' we have /i(p^) = 1 if p^ = 1 (mod 4) and 
h{p^) = 1/p otherwise. Balog and Wooley's result places restrictions on the validity of 
(1.4). 
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Corollary 1.3. Let A, S, h, fq and 7^ be as above. Let x be sufficiently large and in 
particular suppose that S C [1, log log x]. Suppose that < h{n) < 1 for all n. Suppose 
that 

1 — hiv) 

(1.7) y^ log p > a log log x, 

/-^ p 

p<\ogx 

for some a > 60 log log log x/ log log x and set rj = min(Q;/3, 1/100) . Then for each h/rf < 
u < ri{\ogxY''^ there exists y G {x/A,x) and an arithmetic progression a (mod t) with 
i < x/(logx)" and {i,S) = 1 such that 

A{y; e, a) - IMy^M. » exp ( - ^(1 + 257^) log{2u/r^')) ^. 

Remarks. Since the Corollary appears quite technical, some explanation is in order. 

• The condition < h{n) < 1 is not as restrictive as it might appear. We will show in 
Proposition 2.1 if there are many primes with h{p) > 1 then it is quite easy to construct 
large discrepancies for the sequence A. 

• The condition (1.7) ensures that h{p) is not always close to 1; this is essential in order 
to eliminate the very well behaved Example 1. 

• The conclusion of the Corollary may be weakly (but perhaps more transparently) 

written as 

fi{a) A{x) _ A{x) 



A{y;i,a) — y 



hi X 



>. 



• The lower bound given is a multiple of A{x)/(f){t), rather than of the main term 
{f£{a)/i.'-yi){iiA{x)/x). The main reason for this is that fi{a) may well be 0, in which case 
such a bound would have no content. In fact, since {y/x) < 1 and (^(£) < £7^, so the 
function used is larger and more meaningful than the main term itself. 

• It might appear more natural to compare A{y]l,a) with (/^(a)/£7^)^(y). In most 
examples that we consider the average A{x)/x "varies slowly" with x, so we expect little 
difference between A{y) and yA{x)/x (we have ~ l/logx in Example 2, and ~ C / ^J\ogx 
in Example 3 above). If there is a substantial difference between A{y) and yA{x)/x then 
this already indicates large scale fluctuations in the distribution of A. 

Corollary 1.3 gives a Roth- type result for general arithmetic sequences which do not 
look like the set of all natural numbers. We will deduce it in Section 2 from the stronger, 
but more technical. Theorem 2.4 below. Clearly Corollary 1.3 applies to the sequences of 
primes (with a = 1 + o(l)) and sums of two squares (with a = 1/2 + o(l)), two results 
already known. Surprisingly it applies also to any subset of the primes: 
Example 4. Let A be any subset of the primes. Then for any flxed u > 1 and sufficiently 
large x there exists £ < x/(logx)^ such that, for some y G {x/4,x) and some arithmetic 
progression a (mod £) with (a,£) = 1, we have 



A{y;e,a)- 



1 yA{x) A{x 



(i){i) X (Pii) 



>^ 
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This implies the first result of section la. A similar result holds for any subset of the 
numbers that are sums of two squares. 

Example 5. Let A be any subset of those integers < x having no prime factor in the 
interval [(log x)^"*^, logo;]. We can apply Corollary 1.3 since a > e + o(l), and then easily 
deduce the second result of section la. 

Our next result gives an "uncertainty principle" implying that we either have poor 
distribution in long arithmetic progressions, or in short intervals. 

Corollary 1.4. Let A, S, h, fq and 7^ be as above. Suppose that < h{n) < 1 
for all n. Suppose that (1.7) holds for some a > 60 log log log x/ log log x and set rj = 
min(Q;/3, 1/100). Then for each b/rf < u < ri{logx)'^''^ at least one of the following two 
assertions holds: 

(i) There exists an interval (f, v + y) C (x/4, x) with y > (logo;)" such that 

A{v + y)- A{v) - y^^ > exp f - -(1 + 25r]) log{2u/T]^))y^^^. 

X \ r] / X 

(ii) There exists y G (x/4, x) and an arithmetic progression a (mod q) with {q, S) = 1 
and q < exp(2(logx)^~'') such that 

A{y; q, a) - IMy^^ » exp ( - ^(1 + 257?) log{2u/v')) ^. 
qiq X \ r] J (p{q) 

Corollary 1.4 is our general version of Maier's result; it is a weak form of the more 
technical Theorem 2.5. Again condition (1.7) is invoked to keep away from Example 
1. Note that we are only able to conclude a dichotomy: either there is a large interval 
{VjV + y) C (x/4, x) with y > (logx)'" where the density of A is altered, or there is an 
arithmetic progression to a very small modulus {q < x^) where the distribution differs 
from the expected. This is unavoidable in general, and our "uncertainty principle" is aptly 
named, for we can construct sequences (see §6a. Example 6) which are well distributed 
in short intervals (and then by Corollary 1.4 such a sequence will exhibit fluctuations 
in arithmetic progressions). In Maier's original result the sequence was easily proved to 
be well-distributed in these long arithmetic progressions (and so exhibited fluctuations in 
short intervals, by Corollary 1.4). 

Our proofs develop Maier's "matrix method" of playing off arithmetic progressions 
against short intervals or other arithmetic progressions (see §2). In the earlier work on 
primes and sums of two squares, the problem then reduced to showing oscillations in certain 
sifting functions arising from the theory of the half dimensional (for sums of two squares) 
and linear (for primes) sieves. In our case the problem boils down to proving oscillations in 
the mean- value of the more general class of multiplicative functions satisfying < f{n) < 1 
for all n (see Theorem 3.1). Along with our general formalism, this forms the main new 
ingredient of our paper and is partly motivated by our previous work [6] on multiplicative 
functions and integral equations. In §7 we present a simple analogue of such oscillation 
results for a wide class of integral equations which has the flavor of a classical "uncertainty 
principle" from Fourier analysis. 

This broader framework has allowed us to improve the uniformity of the earlier result 
for primes, and to obtain perhaps best possible results in this context. 
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Theorem 1.5. Let x be large and suppose logo; < y < exp(/3 A/log a;/2v/log log x), for a 
certain absolute constant (3 > 0. Define 

A(x, y) = {i}{x + y) - ??(x) - y)/y, 

where 'd{x) = J2p<x ^^SP- There exist numbers x± in (x, 2x) such that 

A(x+, y) > y-'^(^'y) and A{x-,y) < -y-^('''y\ 

where 

^(x,.) = ^^(log(^i^) +loglog(-!^) +0(1)). 

log log X \ \ log log xJ \ log log xJ / 

These bounds are ::^ 1 if y = (loga;)'^*-^''. If y = exp((logx)^) for < r < 1/2 then these 
bounds are ^ ^-'^(1+0(1)). Thus we note that the asymptotic, suggested by probability 
considerations, 

^?(a; + y) - i}{x) = y + 0(y^+'), 

fails sometimes for y < exp((logx) 2"'^). A. Hildebrand and Maier [14] had previously 
shown such a result for y < exp((logx) 3""^) (more precisely they obtained a bound 3> 
y-(i+o(i))T/(i-T) jj^ ^]^g range < r < 1/3), and were able to obtain our result assuming 
the validity of the Generalized Riemann Hypothesis. We have also been able to extend 
the uniformity with which Friedlander and Granville's result (1.1) holds, obtaining results 
which previously Friedlander, Granville, Hildebrand and Maier [4] established conditionally 
on the Generalized Riemann Hypothesis. We will describe these in §5. 

This paper is structured as follows: In section 2 we describe the framework in more detail, 
and show how Maier's method reduces our problems to exhibiting oscillations in the mean- 
values of multiplicative functions. This is investigated in section 3 which contains the main 
new technical results of the paper. From these results we quickly obtain in §4 our main 
general results on irregularities of distribution. In §5 we study in detail irregularities in the 
distribution of primes. Our general framework allows us to substitute a zero-density result 
of P. X. Gallagher where previously the Generalized Riemann Hypothesis was required. In 
§6 we give more examples of sequences covered by our methods. Finally in §7 we discuss the 
analogy between integral equations and mean-values of multiplicative functions, showing 
that the oscillation theorems of §3 may be viewed as an "uncertainty principle" for solutions 
to integral equations. 

2. The framework 

Recall from the introduction that a{n) > and that A{x) = J2n<x '^(^)- Recall that S is 
a finite set of 'bad' primes, and that h denotes a non-negative multiplicative function that 
we shall think of as providing an approximation 

(2.1) Ad{x):=J2a{n)^^A{x), 

n<x 

d\n 
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for each {d,S) = 1. Roughly speaking, we think of h{d)/d as being the "probabihty" of 
being divisible by d. The condition that h is multiplicative means that the "event" of 
being divisible by di is independent of the "event" of being divisible by d2, for coprime 
integers di and d2- We may assume that h{p^) < p^ for all prime powers p^ without any 
significant loss of generality. As we shall see shortly we may also assume that h{p^) < 1 
without losing interesting examples. Let 



A{x]q,a)= 2_. a(n). 



n=a (mod q) 

We hypothesize that, for {q,S) = 1, the asymptotics of v4(x; g, a) depends only on the 
greatest common divisor of a and q. Our aim is to investigate the limitations of such a 
model. 

First let us describe what (2.1) and our hypothesis predict for the asymptotics of 
A{x;q,a). Writing {q,a) = to, since \{b (mod g) : {b,q) = m}\ = ip{q/m), from our 
hypothesis on A{x] q, a) depending only on (g, a) we would guess that 

A{x]q,a) ^ , , . V a{n)= V a(?i) V ^((/) = y2^{d)Adm{x). 

yq^n)^m ra\'n dl -^ 



Using now (2.1) we would guess that 

(2.2) M,; ,, a) . Ai.)^ j: .^MM .. IM^f,), 

d\ — 

' m 

where 

(-) ^.-n(ir^)"'-n(-i)(i-^-^-.. 

p\q P 

and fq{a) is a suitable multiplicative function with fq{a) = /^((a,^)) so that it is periodic 
with period g, which we now define. Evidently fq{p^) = 1 if p\ q. If p divides q, indeed if 
p^ is the highest power of p dividing q then 




Up') ■■- 

ifk>e. 

Note that if q is squarefree and h{p) < 1 then fq{p^) < 1 for all prime powers p'^. 

We are interested in understanding the limitations to the model (2.2). We begin with 
a simple observation that allows us to restrict attention to the case < h{n) < 1 for all n. 
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Proposition 2.1. Suppose that q < x is an integer for which h{q) > 6. Then either 

1 /,(0) 



A{x;q,0)-^^A{x] 



> 



2 qiq 



-A{x) 



or, for every prime i in the range x > £ > 3{x + q)/h{q) which does not divide q, there is 
an arithmetic progression b (mod £) such that 



A{x;e,b)-^A{x] 



> '-l^Aix) 



2 £7, 



e 



The first criterion is equivalent to |v4.q(x) — {h{q)/q)A{x)\ > ^{h{q)/q)A{x), since 

fq{0)/Qlq = fq{(l)/(llq = Kq) / Q- 



Proof. If the first option fails then 



1 /.(O) 



^ A{x]l,nq)> ^ a{nq)=A{x]q,Qi)>-^-^^^A{x 



i<x/(j 



I'^x/ q 



2 qiq 



h{q) 
2q 



A{x) 



On the other hand, if prime I \ q then fiijiq) = 1 if £ f n, and fiijiq) = h{l)'-y^ if l\n. 
Therefore for any A^, 






fijnq) 

he. 



Xi. hi ^ 4^, 



n<Af 



n<N 
l\n 



m 



^ {[N] - [N/e]) - ^-^hmN/e} < ^ + ^ 



Combining this (taking A^ = x/q) with the display above yields 

J: Aix-Xnq) > 'M^ix) > '-^Aix) > I J: f^^^'^ 



2q 



2qe 



A{x) 



n<x/q n<x/q 

which implies the Proposition with b = nq for some n < x/q. 

We typically apply this theorem with h{q) > log x for some large A. This is easily 
organized if, say, h{p) > 1 + 77 for > rjz/ log z primes p G {z/2,z) where z < logx, and 
letting q be the product of [rjz/ log z] of these primes so that q = e'^^^^^°^^'' and we can 
select any i in the range x > £ > x/ exp({r]'^/2)z/ log 2). 

Proposition 2.1 allows us to handle sequences for which h{p) is significantly larger than 
1 for many primes. Therefore we will, from now on, restrict ourselves to the case when 
< h{n) < 1 for all n. Suppose that {q, S) = 1 and define Ag = Aq{x) by 



(2.4) 



Ag(x) := max max 

x/4<.y<.x a (mod q) 



A{y]q,a) 



IMlAix) 

Qlq X 



A{x) 
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In view of (2.2) it seems more natural to consider \A{y; q, a) — fq{a)/{q'-yq)A{y)\ instead of 
(2.4) above. However (2.4) seems to be the most convenient way to formulate our results, 
and should be thought of as incorporating a hypothesis that A{y)/y is very close to A{x)/x 
when x/4 < y < x. Formally we say that A{x)/x is slowly varying: a typical case is when 
A{x)/x behaves like a power of logx, a feature seen in the motivating examples of A being 
the set of primes, or sums of two squares. With these preliminaries in place we can now 
formulate our main principle. 

Proposition 2.2. Let x be large and let A, S h, fq and Ag he as above. Let q < y/x < 
£ < x/4 be positive coprime integers with {q,S) = {£,S) = 1. Then 



<K^^'<"' + iK«)^'<"' + ""'>> 



1 



[x/2i] ^ 



7g 



1 



Proof. Let R := [x/{4q)] > ^/x/5 and 5" := [x/{2£)] < ^/x/2. We sum the values of a{n) 
as n varies over the integers in the following R x S "Maier matrix." 



{R + l)q + e {R+l)q + 2e 

{R + 2)q + i {R + 2)q + 2e 

{R + 3)q + i 

{R + 4)q + e : 



2Rq + i 



{R+l)q + Se 
{R + 2)q + Se 



(r, s)th entry : 

{R + r)q + si 



2Rq + Si 



We sum the values of a{n) in two ways: first row by row, and second column by column. 
Note that the n appearing in our "matrix" all lie between x/4 and x. 

The r-th row contributes A{{R + r)q + IS; £, {R + r)q) -A{{R + r)q; i, {R + r)q). Using 
(2.4), and noting that fei{R + r)q) = fi{R + r) as (£, q) = 1, this is 



ill 



X 



m' 



Summing this over all the rows we see that the sum of a^ with n ranging over the Maier 
matrix above equals 



(2.5a) 



f -4(x-) f 



h(r) 



+«(ss-^w^)- 



The contribution of column s is A{2Rq + is;q,£s) — A{Rq + Is^q^ls). By (2.4), and 
since fq{is) = fq{s) as (£, g) = 1, we see that this is 

f'^^'^'^U{x) + 0(^A{x] 



Qlq X 



<P{q) 
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Summing this over all the columns we see that the Maier matrix sum is 

(2.5b) ^A{x)J2^^+0(^,A{x)s). 

Comparing (2.5a) and (2.5b) we deduce that 

C Of? 

Write fe{r) = '^dlrd^i^) ^°^ ^ multiplicative function g£. Note that gi{p^) = if 
p \ I. We also check easily that \g(,{p^)\ < {p + l)/(p — 1) for primes p|£, and note that 
li = T.7=i9e{d)/d. Thus 

7^ Z Mr) - ^ E ..(.0(^0,:)) . i.o(l Y. ^^^ E l«'('OI 

'* r=_R+l '* d<2i? '*^ d>2i? '* d<2i? 

We see easily that the error terms above are bounded by 

«4_f;k£p«J,rr(i + o(4))«4. 

since t <x^ and -R ^ i/a;- We conclude that 

2K 



^ Y^ Mr) = l + 0{R-'^[ 



'^ r=R+l 

Combining this with (2.6) we obtain the Proposition. 

In Proposition 2.2 we compared the distribution of ^ in two arithmetic progressions. We 
may also compare the distribution of A in an arithmetic progression versus the distribution 
in short intervals. Define A(y) = A(y, x) by 



(2.7) A(y, x) := max A{v + y) — A{v) — y 

{v,v+y)c{x/4,x) 



A{x) I A{x) 



X 



y- 



X 



Proposition 2.3. Let x he large and let A, S, h, fq, Ag and A he as ahove. Let q < ^ 
with (g, 5) = 1 and let y < x/4 he positive integers. Then 



<P{q) 



A,(x)+A(x,2/)> 5^/,(s)-l 



^'^y s<y 



Proof. The argument is similar to the proof of Proposition 2.2, starting with an R x y 
"Maier matrix" (again R = [x/{4q)]) whose (r, s)-th entry is {R + r)q + s. We omit the 
details. 

We are finally ready to state our main general Theorems which will be proved in §4. 
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Theorem 2.4. Let x be large, and in particular suppose that S C [l,loglogx]. Let 
1/100 > T] > 20 log log log x/ log log a; and suppose that (logo;)'' < z < (logx)/3 is such 
that 

E ^^^^>r?log((l-r,)-^). 

z^—n<p<z 

Then for all h/rf < u < -J~z 

max A^ ^ exp (— w(l + 25?]) log(2ti/?7^)) . 

(,<x I z^ 
{£,S) = 1 

Note that J2z^-^<p<z ^/P ~ log((l ~ v)~^)- There is an analogous result for short 
intervals. 

Theorem 2.5. Let x be large, and in particular suppose that S C [1, log log x]. Let 
1/100 > rj > 20 log log log x/ log log X and suppose that (logx)'' < z < (logx)/3 is such 
that 

z^—n<p<z 

Then for each h/rf <u< -J~z at least one of the following statements is true: 

(i) For q < e^^ which is composed only of primes in [z^~'^,z\ (and so with {q,S) = 1) 
and such that J2p\ni^ ~ h{p))/p > rf , we have Ag ^ exp(— ^(1 + 25??) log(2w/?7^)). 

(a) There exists y > z"^ with A{y) ^ exp(— w(l + 25r]) log{2u /rj'^)). 

Deduction of Corollary 1.3. We see readily that there exists (logx)'' < z < (logx)/3 
satisfying the hypothesis of Theorem 2.4. Applying Theorem 2.4 (with u/rj there instead 
of u) we find that there exists £ < x/z'^''^ < x/(logx)" with (i^S) = 1 and A^ ^ 
exp (— («/?])( 1 + 25?]) log ( 2^/77^)). The corollary follows easily. 

Deduction of Corollary I.4. We may find (logx)'' < z < (logx)^"'' satisfying the hypoth- 
esis of Theorem 2.5. The corollary follows easily by applying Theorem 2.5 with u/r] there 
in place of u. 

3. Oscillations in mean- values of multiplicative functions 

3a. Large oscillations. 

Throughout this section we shall assume that z is large, and that q is an integer all of 
whose prime factors are < z. Let fq{n) be a multiplicative function with fq{p^) = 1 for 
all p \ q, and < fq{n) < 1 for all n. Note that fq{n) = fq{{n,q)) is periodic (mod q). 
Define 



F,M = E ^ = «»)G,(»). where G,(s) = l[{'~) (l+^+^^ 



n=l p\q 
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To start with Fg is defined in Re(s) > 1, but note that the above furnishes a meromorphic 
continuation to Re(s) > 0. Note also that 7^ = Gq{l) in the notation of §2. Define 

^(^)^=^ J](/,(n)-G,(l)), 
and put for all complex numbers ^ 
^.(0 := E i^AM//iog.^l^M^y for each , > 0, and J(0 := ^ ^p'^^ 

p\q p\q 

Theorem 3.1. With notations as above, we have for 1 < ^ < | log 2 

\E{u)\<eMHiO-^n + 5J{0). 
Let f log2 > ^ > 7r and suppose that H{^) > 20if2(O + 76J(0 + 20, so that 



loe 2 



r := ^{5H2{0 + l9J{C) + 5)/H{0 < 1/2. 
Then there exist points u± in the interval [H{^){1 — 2r), H{^){1 + 2r)] such that 

E{u+) > ^5^^ exp{H{0 - ^u+ - bH^iO - 5 J(0}, 

and 

E{u-) < -^5^^ exp{iy(0 - ^u- - 5H2{0 - 5 J(0}- 

In section 3b (Proposition 3.8) we will show that under certain special circumstances 
one can reduce length of the range for u± to 2. 
We now record some corollaries of Theorem 3.1. 

Corollary 3.2. Let z~~^ < t] < 1/100 and suppose that q is composed of primes in 
[z^~'^, z] and that 



E 



1 - Up) ^ .2 



> 



p 

p\q 



V 



Then for -J~z > u > h/rf' there exist points u± G [u, u{l + 22r])] such that 

E{u+)> exp(-u{l+25r)) log (-^)\ and E{u-) < -exp (-u{l+25r]) log (-^ 
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Proof. Note that for 1 < ^ < ^^ fog z 



I^IMm^ 



p\q 

and that 

H2{0 < v'HiO. and J{0 < e^h^-^ < ii^H{i), 

where the last inequaUty for J(^) is easily checked using our lower bound for H{^) and 
keeping in mind that z~~^ < ?7 < 1/100 and that ^ < ^\ogz. From these estimates it 
follows that ii H{^) > b/rf then r (in Theorem 3.1) is < brj. Therefore from Theorem 
3.1 we conclude that if H{^) > h/rf and tt < ^ < ^\ogz then there exist points u± in 
[H{i){l - 10?7), H{i){l + 10?])] such that 

E{u+) > ^^^1^ exp(if (0 - ^u+ - 5H2{0 - 5J(0) > £-«"+, 

and E{u-) < — e~^"". Renaming H{^){1 — 10?]) = w so that ^ < j^ \og{2u/Tf') we easily 
obtain the Corollary. 

Corollary 3.3. Suppose that q is divisible only by primes between y/z and z. Further 
suppose c is a positive constant such that for 1 < ^ < | log 2; we have -ff(^) > ce^/C- Then 
there is a positive constant A (depending only on c) such that for all e^ < u <ti cz^l'^ j \ogz, 
the interval [^(1 — A/logw), w(l + A/logw)] contains points u± satisfying 

E{u+) >exp{-w+(logw+ +loglogw+ + 0(l))}, 

and 

E{u-) < -exp{--u+(log-u+ + loglog-u+ + 0(1))}. 

The implied constants above depend only on c. Note that X]v^<p<2 l/p^~^/'°^^ x e^/^, 
by the prime number theorem. Thus H[^) <^ e^/^, and the criterion H[^) > ce^/^ in 
Corollary 3.3 may be loosely interpreted as saying that, "typically", 1 — fq{p) ^ c. 

If iir(^) ~ Ke^/C then our bounds take the shape exp{— w(log(ti/K)+loglogw — H-o(l))}. 

Proof of Corollary 3.3. In this situation J(0 < Evi<p<zP^^^^°^ Vp^ < e^/^ + e^^/z < 
e^/^"^. Further using the prime number theorem 

-^ p y logz J jr-,tiogt\ logz J e 

The corollary now follows from Theorem 3.1, and renaming u = H{^) so that ^ = logti + 
log log tt + 0(1). 
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Corollary 3.4. Keep notations as in Theorem 3.1, and suppose q is divisible only by the 
primes between z/2 and z. Further suppose that c is a positive constant such that for 1 < 
^ < I logz we have H{^) > ce^ /logz. Then there is a positive constant A (depending only 
on c) such that for all e"^ < u <ti cz^l'^ j \o<gz the interval [w(l — Aj log-u), w(l + Aj logw)] 
contains points u± satisfying 

E{u+) > - — exp{-u+{logu+ + loglogz + 0{1))}, 

log log z 

and 

Eiu-) < —- — exp{— -u-flog-u- + loglog^ + 0(1))}. 

log log z 

As in Corollary 3.3, the implied constants above depend only on c. Also note that H{^) 
in this case is always < ^z/2<p<z l/p^~^'^^^^ x e^ /\ogz. 

Proof of Corollary 3.4- In this case J(^) <^ ^^^ Z]p>2/2 l/^^ ^ e'^^zlogz ^ e^log^ z. 
Further note that H2iC) < {(^^ /^og^ z)J2z/2<p<z'^/P < e^/log^2. Taking u = H{^) so 
that ^ = log « + log logz + 0(1) and thus H{^) <^ e^/logz, we easily deduce Corollary 3.4 
from Theorem 3.1. 

Corollary 3.5. Keep the notations of Theorem 3.1, and suppose (as in Corollary 3.3) 
that q is divisible only by primes between ^/z and z and that for 1 < ^ < | logz; we have 
H{$,) > ce^/C- Let y = z^ with 1 < u <^ cz^l"^ jXogz. There is a positive constant B 
(depending only on c) such that the interval [1^ 2'"(i+'^/i°g("+i))+^] contains numbers v± 
satisfying 

- y, U,{n) - G,(l)) > exp{-ti(log(ti + 1) + log \og{u + 2) + 0(1))}, 

y 



and 



- Y. (^'?(^) - G'g(l)) < - exp{-ti(log(tx + 1) + log \og{u + 2) + 0(1))}. 



u„ <n<D_+j/ 

Proof. Appealing to Corollary 3.3 we see that there is some w — z'^'^ with ui E [e^ +u{l + 
D/ log{u + 1)), e^ + u{l + {D + 3A)/ log{u + 1))] (here A is as in Corollary 3.3 and D is a 
suitably large positive constant) such that 

^ifqi'^) -Gq{l)) > tyexp(-wi(logwi + loglog'Ui + Ci))j 

n<.w 

for some absolute constant Ci. 
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We now divide the interval [l,w] into subintervals of the form {w — ky,w — {k — l)y] 
for integers I < k < [w/y], together with one last interval [1, yo] where yo = w — [w/y]y = 
y{w/y}. Put yo = z^" . Then, using the first part of Theorem 3.1 to bound |£'(tio)| (taking 
there ^ = log(t(.o + 1) + loglog(tto + 2)), we get that 

I 5^ (/,(n) - ^,(1))! = yo|^(^^o)| < yo eM-ino + l)(log(txo + 1) + loglog(t/o + 2) - C2)) 
n<yo 

for some absolute constant C2. 

From the last two displayed equations we conclude that if D is large enough (in terms 
of Ci and C2) then 

J2 {fq{n)-G,{l))>weM-u{log{u + l)+\og\og{u + 2) + 0{l))} 

yo<n<w 

so that the lower bound in the Corollary follows with v+ = w — ky for some 1 < k < [w/y]. 
The proof of the upper bound in the Corollary is similar. 

We now embark on the proof of Theorem 3.1. We will write below fq{n) = J2d\n9qi^) 
for a multiplicative function Qq, the coefficients of the Dirichlet series Gq{s). Note that 
9q{p'') = for p \ q, and if p\q then gq{p'') = fq{p^) - fq{p^~^). Clearly ^^(1) = 
'Yl^=i9q{d)/d. Let [f] and {t} denote respectively the integer and fractional part of t. 
Then 

1 .r^ '(T^ ... f^"l„ ,.. 1 °° 



(3-1) E{u) =-y: e^.(^) - ^-^Gqii) =-j:9q{d){ 



n<z" d\n d=l 



d 



We begin by establishing the upper bound for |-E'(t(.)| in Theorem 3.1. 
Proposition 3.6. In the range 1 < ^ < | log 2 we have 

\E{u)\<expiH{C)-Cu + 5JiC) 
and also 



poo o 

/ e^"|E(w)|rfw< -exp(if(0 + 5J(0)• 



As will be evident from the proof the condition ^ < § log z may be replaced by ^ < 
(1 — e) log 2. The constants 3 and 5 will have to be replaced with appropriate constants 
depending only on e. 

Proof of Proposition 3.6. Since |[2"/(i] — [z"]/(i| < min(2'"/(i, 1) we obtain, from (3.1), that 



\Elu]\ < V ^ii^+ V '^'^^^' <c-g" V l^9l"Jl ^g/loK.. 

'^^'"^^2" "^ d ~ ^ d 

d<z^ d>z" d=l 
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and also that 



e^-\E{u)\du<Y,\9M)\ 



d=i 



log d/ log z g^ 



3?" 



du + ~~ir^^ 

^ J\ogd/ \ogz ^ 



< 



11°° 



i logz-^/^ d 



\9,{d)\ 



d^/ 



log 2 



Now, as each |^(p^)| < 1 and as 2^/^/ {2^/^ - 1) < 5, 

- ii V ^ 5nl-^/logZ ^ Z^ ^fc(l-C 



d=l 



rf 



p|g 



p. 



k=2 



p. 



^/logz) 



< 



ii V ^pl-^/logz^V p2(l 



p2(l-C/Iog2) 



since p > 2 and ^ < § log 2. The Proposition follows upon taking logarithms. 



Define 



/•CXD 

I{s) = / e-'''E{u)du. 
Jo 



From Proposition 3.6 it is clear that I{s) converges absolutely in Re(s) > — |log2, and 
thus defines an analytic function in this region. Further if Re(s) > then 

I{s) = I > ■ (Lin) - GJD) = > '(Lin) - GJD) I '—^du 







z"- 



(3.2) 



- E (AH - ^^(1)) = E(AH - ^^(1)) , 

n<z" n=l J\ogn/\ogz 

C(l + s/log^) 



log z + s 



■{G,{l + s/\ogz)-G,{l)). 



By analytic continuation this identity continues to hold for all Re(s) > — § log 2. 
Proposition 3.7. For 1 < ^ < flog^ with z sufficiently large, we have 

p e^^\E{u)\du > ^^l-^(exp {H{0 - 5if2(6 " 5J(0} " l)- 

Proof. Taking s = — (^ + iir) in (3.2) we deduce that, since |Gg(l)| < 1, 

|C(l + s/log^) 



/•CXD 

/ e^''\E{u)\du>\I{s)\> 
Jo 



\s + logz\ 



(1^,(1 + s/log^)|-l 



From the formula C(f^) = 'w/{w — 1) — w f-^ {x}x ^ "^dx, which is valid for all Re(zi;) > 0, 
we glean that 



|C(l + g/log^) 
|s + log^l 



> 



Re 



-1 



1 



i^+iTT) logZ Ji 



{x}x' 



-2+^/log.,,J!;l^W, 



log 2; 
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For large z and ^ < | log^ we see easily that the integral above is positive^, and so we 
deduce that 

|C(1 + si logz)|/|s + logzl > Re(l/(e + ztt)) = il^e + ^')- 



Next we give a lower bound for 1^5(1 + s/log2)|. We claim that for z > 101^ and for 



all primes p 

(3.3) 



. , fqjP) fq{P 



pl+s/logz p2{l+s/logz) 



+ .. 



> 



^ ^ fqiP) 



P 



1+s/logz 



103 



1 



100»2-2C/log2: 



When p > 101"^ we simply use that the left side of (3.3) exceeds |1 + fq{p)/p^~^^^^"^^\ — 
Y!k=2^/P^^^~^'^''^^^ and the claim follows. For smaU p < 101^, set K = [logz/(2 logp)] 
and observe that for k < K the numbers f^(^p^^/p'^(^+^/^'^sz) ^n i^ave argument in the 
range [0, 7r/2]. Hence the left side of (3.3) exceeds, writing q = l/pi~€/iog2^ 



1 + 



Up) 



l+s/logz 



p 



2-^ .,k 



> 



k>K 



1 + 



Up) 



l+s/logz 



p 



q^-^{q-iy 



which implies (3.3) for z > 101^. 
Observe that if \w\ < 2~^'^ then 



log |1 + wl = Re {w - ^(-l)"«;"/?i) > Re {w) - 5\w\'^/4. 



n=2 



From this observation and our claim (3.3) we deduce easily that 



log 



1 



P 



l + s/log2 



1+ /^y +... 



p 



l+s/logz 



> Re 



Up) - 1 



pl + s/log2 / p2(l-5/log2) ■ 



It follows that 



log |G,(1 + s/logz)| > -Re J] ( ^-^^^ ) p-^/'°g^ - 5J(0 



v\q 



^- fq{p) \ ^-s/logz 
P 



m)^E(^)p 

p\q 



^^/i«^^f-l-cosf^^^^))-5J(0. 
log 2 



Since —1 — cos(7rlogp/ log^) > — (7r^/2)(log(2/p)/ logz)^, we deduce the Proposition. 
We are now ready to prove Theorem 3.1. 



iln fact, for 2 > 200. 
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Proof of Theorem 3.1. The first part of tfie result was proved in Proposition 3.6. Now, let 
I^ (and /~) denote tfie set of values u with E{u) > (respectively E{u) < 0). Taking 
s = — ^ in (3.2) we deduce that for ^ < | log^ 



e^''E{u)d 



u 



< 2 



ai-^/\ogz) 



\ogz-i 



6 



since < Gq{l—^/\ogz),Gq{l) < 1, and since {usmg C,{w) / w = l/{w—l)—J^{x}x ^ '^dw) 



ai-inogz) 



{\ogz-i) 



1 1 1"°^ 

i logzj. 



1 

< T + 



< 



^ log^-^ C 



Combining this with Proposition 3.7 we deduce easily that 



(3.4) 



1 



■(exp{if(0-5if2(e)-5J(0}-l 



3 



>-exp{if(0-5if2(0-5J(0}- 



Put ui = H{^){1 + 2t). Then by Proposition 3.6 we get that 

/'OO /'OO o 

/ e^"|E(w)|rfw<e-^"i / e^^+^^''\E{u)\du<-exp{H{^ + T)-TUi + 5J{^ + r)}. 

Jui Jo ^ 

Now H{C + r) < e^H{i) < (1 + r + r^)H{C), and J(^ + r) < e^^ J(^) < 2.8J(^). Hence 
H{C + t) + 5J{C + r) - TUi < H{^) - 5H2H) - 5J(^) - 5, and so we conclude that 



(3.5) 



/ e^^\E{u)\du < exp{if(0 - 5if2(0 " 5J(0}- 



Similarly note that, with uq = H{^){1 — 2r), 



pUQ poo 

/ e^''\E{u)\du < e^""" e^^-^^''\E{u)\du < 

Jo Jo 

Now J(^-r) < J(0, and 



C-r 



exp{if (^ - r) + TWO + 5J(^ - r)}. 



p|g 



^ fl^P^ pU log z^-r/ log 2 ^ y^ ^ fl(P) pU log ^ ("l 

p ~ -^^ p V logz ' 2 



logp r^ 



p\q 



since -ffi(^) < ^/H{^)H2{^) by Cauchy's inequality. From these observations and our 
definition of r it follows that H{C - r) + 5J(^ - r) + r-uo < ^(6 - 5ii"2(0 - 5J(6 - 5, 
and so 

e^-\E{u)\du < — exp{if (0 - 5^^2(0 " 5J(0}- 

Combining (3.4), (3.5), and (3.6) we deduce that 



/. 



1 



e^''\E{u)\du > — exp{if (0 - 5^^2(0 - 5J(0}- 
Now ui-uq< 4tH{^) < 2H{^), so the Theorem follows. 
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3b. Localization of sign changes of E. 

We saw in Corollary 3.3 that (in typical situations) E changes sign in intervals of the 
form [u{l — Aj log w), w(l + Aj logw)]. We consider now the problem of providing a better 
localization of the sign changes of E for small values of u. Our main result of this section 
is the following: 

Proposition 3.8. With notation as above, suppose that max2;>^t |-E'(a;)| ^ l/{Gq{l) log 2)) 
for some u > 6. Then there exist points u^^U- G [-u — 1,^ + 1] such that E{u^), —E{u_) > 
max2;>„ \E{x)\. 

Proposition 3.8 (which may be easily deduced from the Lemmas of this section) can be 
used to reduce the size of the interval in Theorem 3.1. In Corollary 3.3 this is simple to 
state: For e"^ < u < log log 2/(2 log log log 2) the interval [w — 1, w + 1] contains points u± 
satisfying the conclusions of Corollary 3.3. 

Lemma 3.9. Uniformly for u > we have 

uEiu)+ rEit)dt+-^T'-^MPliogpE(u-'^)=o(--^ 
Ju ^OS-^^ P ^ ^OS-^^ VG'g(l)log^ 

Proof. Let Ei{u) :— J^dyz-^ dq{d)/d] and note that |i?i(w)| < Yla \9qi.d)\/d < 1/Gq{l). By 
a result of R.R. Hall (see [13], or (4.1) of [10]) we see that 



^EW'')i«^E^« 



2"" ^^-^ ulogz'^-^ d Ga{^)u\ogz 

Therefore, from (3.1) we deduce that 

(3.7) Eiu) = -il + 0{z-n)E^{u)+0{^ Yl l^Ml) = ' ^^^''^ + ^ {-—^^-—- 

d<z^ '^^ ' ^ 

Manipulation of Ei{u) yields our identity. The starting point is the observation that 

(3.8) uEM+rE,(f),t = uEM+ E ^(!^-«) = E ^!^- 

/„ -^^ d Vlog^ / -^^ d log 2 

•^^ d>z^ ^ d>z^ *= 

We approximate the left side of (3.8) as follows: 

{uEi{u)+ Ei{t)dt) + {uE{u) + E{t)dt) 

J u J u 

<u\Ei{u)+E{u)\+ I \Ei{t) + E{t)\dt 



J u 



1 f^ /I \ 

Gq{l)\ogz J^ VG'g(l) j^„ / 

G,(l)log^^log^V4- d J^Gq{l)logz 
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Now \ogd — X]m|d^("^) ^° ^^^^ ^^^ right side of (3.8) equals 

J-yA(m)y ^. 

log 2 ^^ ^^ li 

Tn\d 

The sum over to's above can be restricted to prime powers p'^ ior p < z (else gq{d) — 0). 
Further the contribution of prime powers p^ with k >2 is bounded by <^ l/(Gg(l) log 2). 
Finally for m = p < z we see that 

v^ 9q{d) _ gqjp) y- gq{d) / 1 A 1 - fq{p) p / ^ logP^ I ^( 1 

f-„ rf p ^^^^ d ^ yp^Gqii)) p '\ logzr Vg,(i) 

m\d 

Therefore, by (3.7), this, taken with the estimate for the left side of (3.8), yields the result. 

We call a point w special if |-E'(ty)| = maxa;>^„ |E'(x)|. Since E{x) -^ as a; -^ 00 we see 
that there are arbitrarily large special points. 

Lemma 3.10. Given u > 2 either E{x) = 0{l/{Gq{l) logz)) for all x > u or there is a 
special point in [u, u+ 1]. 

Proof. Let w denote the infimum of the set of special points at least as large as -u, and 
assume w > u + 1 (that is, there is no special point in [u,u + 1]). Note that |-E'(ty)| > 
|i?(x)| + 0{z~^) for any x > u. If E{x) maintains the same sign for all x > w set v = 00; 
otherwise let v denote the infimum of those points x > w for which E{x) has the opposite 
sign to E{w). Note that E{v) = 0{l/z'"). Taking Lemma 3.9 with u — w and u — v and 
subtracting we find that 

wE[w) + / E[t)dt H > logp \E\w — — E\v 

J^ iogz ^ p V V logz/ V logz 

p<2 

(3.9) =0' "^ 



G',(l)log2. 
Since -E'(t) maintains the same sign throughout \w.,v\ we have that 

wE{w)+ E{t)dt >w\E{w)\, 

Jw 

while on the other hand 

> ^ logp [E[w- -E[v- 

M z ^—^ V V V loe zJ \ ioe z 

2 Y^ logp 



< 



E^^.^^^ 1^(^)1 ^ (2 + o(l))max|£;(OI < (2 + o(l)) ]£;(«;) | 

p<z 

since w was assumed to be larger than -u + 1. We conclude that (w — 1 + o{l))\E{w) 
0{l/{Gq{l) log 2)) which establishes the Lemma. 
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Lemma 3.11. If u > 2 and E{x) maintains the same sign throughout [u,u + 2] then 
E{x) = 0(l/(Gg(l) log^)) for aUx>u + l. 

Proof. Suppose not. Let w denote the infimum of the set of all special points at least as 
large as u + 1. By Lemma 3.10 we know that w < u + 2. Let v denote the infimum of 
points X > u + 2 such that E{x) has the opposite sign to E{w); if no such point exists set 
V = oo. Now E maintains the same sign in [w — 1, v] (since it is a subinterval of [u, v]) and 
so 

wE{w)+ r E{t)dt+-^Y^—^^logpE(w-^^) >w\E{w)\; 
Ju, logz^ p ^"^ V log^y - I V yi 

on the other hand 

log z ^^-^ p \ logzJ 

since \E{v — logp/log^)! < max2;>„_i |-E'(x)| < \E{w)\. Therefore, by (3.9), we deduce 
that (ty — 1 + o{l))\E(w)\ < 0{l/{Gq{l) logz)) which proves the Lemma. 

Proposition 3.12. Fix e > 0. Given a special point w > 4 + e either there exists a point 
$, E [w — l,w] for which E{^) and E{w) have opposite signs and |-E'(^)| > {w — A — e)\E{w)\, 
or E{w) = 0{l/{Gq{l)\ogz)). 

Proof. Select v G [tw + l, ty+3] with E{v) = 0{z~'") {ifv does not exist then the Proposition 
follows from Lemma 3.11). Choose 5 = ±1 so that SE{w) > 0. Suppose that dE{x) > 
— {2w — V — 1 — e)6E{w) for all x G [w — l,w\. Then 6 times the right side of (3.9) is 

> 5Eiw){w -I Idt- -^ y ^ {2W-V-1 -€ + !))> e\E{w)\/2 

since each v — \ogp/logz > w so that \E(v — logp/log^)! < \E{w)\. The result follows 
from (3.9) since 2w — v — l>w — 4. 

Given u > 3 + e we can take zw to be a special point in [-u + l, w + 2] and then ^ G [w, ^ + 2]. 

4. Proof of Theorems 2.4 and 2.5 

We shall only provide the proof of Theorem 2.4, the proof of Theorem 2.5 being entirely 
similar. We take q to be the product of all the primes in {z^~'^, z) so that q < Y[p<zP — 
^i/3+o(i)_ gince S C [l,logloga;] we also know that (q^S) = 1. Since q is squarefree 
we check that fq{n) (as defined in Section 2) satisfies fq{j>^) = 1 \i p \ q and fq{p^) = 
h{p){p — l)/{p — h{j>)) ii p\q and k >1 (note that fq{j>^) < h{p) < 1 in this case) . Thus 
< fq{n) < 1 for all n, and we may apply the results of Section 3, in particular Corollary 
3.2. From Corollary 3.2 we obtain that for y/z > u> b/rf there exists A G [w, ^(1 + 22?7)] 
such that 

(4.1) |i^ ^ /^(n)-G,(l)|>exp(-^l + 25r7)log(^ 
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We now use Proposition 2.2. We may easily find i. G {x/{2{z'^ + l))^x/{2z'^)) such that i. 
is not divisible by any prime below (loga;)/3. Such an i. is coprime to g, satisfies (£, S) = 1, 
and further (/){£)/£ ^ 1. Proposition 2.2 therefore yields, by (4.1), 

Aq + A^ > exp ( - w(l + 25?7) log (^)) , 

proving Theorem 2.4 (in the statement of which we might take i = q). 

5. Limitations to the equidistribution of primes 

In this section we will exploit the fiexibility afforded by our general oscillation results to 
obtain refinements to previous results on the limitations to the equidistribution of primes. 
We stated in Theorem 1.5 (see Introduction) the result for primes in short intervals and 
we now state the analogous results for primes in arithmetic progressions. 

Theorem 5.1. Let £ be large and suppose that £ has fewer than (log^)"'^"'^ prime divisors 
below log£. Suppose that (log^)^"'"'^ < ?/ < exp(/?v^log £/y/2 log log £) for a certain absolute 
constant (3 > 0, and put x = £y. Define for integers a coprime to £ 

A{x]£,a) = li}{x;£,a) - ^ 



ip{£)J / ip{£) 

There exist numbers x± in the interval (x, xy^/^°s*^^°s^/^°s^°s^^), and integers a± coprime 
to £ such that 

A{x+] £, a+) > y-^(^'y\ and A{x-;£, a_) < -y-^(^'y\ 

Here D is an absolute positive constant which depends only on e, and 5(-, ■) is as in Theorem 
1.5. 

These bounds are > 1 if y = (log£)^(^), and > y-^(i+«(i)) \i y = exp((log£)^) with 
< r < 1/2. The corresponding result in [4], Theorem Al, gives the weaker bound 
y-{i+o(i))T / {1-t) (^^]2ough our bound is obtained there assuming the Generalized Riemann 
Hypothesis). Our constraint on the small primes dividing £ is less restrictive than the 
corresponding condition there, though our localization of the x± values is worse (in [4] the 
x± values are localized in intervals {x/2,2x)). 

Theorem 5.1 omits a thin set of moduli £ having very many small prime factors. We 
next give a weaker variant which includes all moduli £. 

Theorem 5.2. Let £ be large and suppose that (log£)^+'^ < y < exp(/3 Vlog £/ Vlog log £) 
for a certain absolute constant (3 > 0, and put x = y£. There exist numbers x± in the 
interval (x, xy'^/^°^*^^°^^/^°^^°^^^), and integers a± coprime to £ such that 

y-Si{e,y) y-5i{£,y) 

A(x+; £, a+) > - — - — and A(x_;£,a_)< 



log log log £ log log log £ 



AN UNCERTAINTY PRINCIPLE FOR ARITHMETIC SEQUENCES 25 

where di{x,y) = (loglogy + 0(l))/(loglogx). Here D is an absolute positive constant 
which depends only on e. 

Theorem 5.2 should be compared with Theorem A2 of [4]. Our bound is ^ y-^(i+o(i)) 
if y = exp((log£)^) with < r < 1/2. The corresponding result in [4], Theorem A2, gives 
a weaker bound y~(^+'^W)'^/(^+'^\ though our localization of the x± values is again much 
worse. 

To prove Theorems 1.5, 5.1 and 5.2 we require knowledge of the distribution of primes 
in certain arithmetic progressions. We begin by describing such a result, which will be 
deduced as a consequence of a theorem of Gallagher [5] . 

For I < j < J := [log 2/(2 log 2)] and consider the dyadic intervals Ij = {z/2^ , z/2^~^]. 
Let Pj denote a subset of the primes in /_,• , and let tvj denote the cardinality of Pj . We let 
Q denote the set of integers q with the following property: q = Y[i=i Ij ^^'^ each qj is the 
product of exactly [7rj/2] distinct primes in Pj. It is clear that all the elements of Q are 
squarefree and lie below Q := 2;^^^^'^, and that \Q\ = Y[j=i (rw2i)- 

There is a constant ci such that at most one primitive L-function with modulus q 
between vT and T has a zero in the region a > 1 — ci/logg, and |t| < T. Further if 
this exceptional Siegel zero exists then it is real, simple and unique (see chapter 14 in [2]). 
We call the modulus of such an exceptional character a Siegel modulus. Below Q there 
are <^ log log Q Siegel moduli. Denote these by i/i, z/2, . . . , i^i, and for each select a prime 
divisor vi, . . . ,ve. Assume none oivi,...ve belongs to Uj^iPj, which guarantees that there 
are no Siegel zeros for any of the moduli d where d is a, divisor of some q & Q. 



Proposition 5.3. Suppose that exp(vlogx) < Q < x where b is a positive absolute 
constant, and let a;exp(— y^Ioglr) < h < x. Then 






, ^, , max 

\Q\ ^^{a,q) = l 
qEQ 



'd{x + h; q, a) — 'd{x; q, a) — h/Lp{q) 



h/ip{q) 



yiogx 
<^ exp — a 

Vlogz 



where a is a positive absolute constant. 

Proof. If (a, q) = 1 then using the orthogonality of characters 

^ x-\-h 

i}{x + h;q,a)-i}{x;q,a) = —— J^ x{a)^x{p)^ogp 

^^^^ X (mod q) 



x-\-h 



^^^^ x<p<x+h ^^^^ X/XO ^ 

(P,<?) = 1 



By the prime number theorem the first term above is h/(p{q){l+0{exp{—c\/logx))}, which 
has an acceptable error term. We now focus on estimating the second term on average. 
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Below the superscript * will indicate a restriction to primitive characters. Observe that 

x-\-h x-\-h 



X/Xo a; 



d\q X (mod d) x 

d>l 

x-\-h 



PI I 



= ^ Y^ \ Y X{p) ^og p +0{d{q) log q). 



d\q X (mod d) x 
d>l 



Thus 



x-\-h 



q^Qx^XO a; 



P 



x+h 



E (El) E* |E^(p)i°g^|+^(i2i^')- 



l<d<Q qeQ X (mod d) 
d\q 



Observe that if d is to have any multiples in Q then we must have d = 117=1 ^j with 



each of the dj being composed of at most [7rj/2] distinct primes from Pj. Therefore 

7Tj-u{dj) \ ^ YJ r,-u(d,) I "i \_r,-^id)\ 






i=i 



log ^ , 



where uj{n) denotes the number of prime factors of n, and the final estimate follows since 
all prime divisors of d are below z and so uj{d) > logd/ logz. From these remarks we see 
that the right side of (5.1) is 



(5.2) 



x-\-h 



<iQiE 2"^ E |E^(^)^°s^ +^(i2 

l<d<Q X (mod d) x 



X 



where the b on the sum over d indicates that the sum is restricted to d as above; note that 
such d have no Siegel zeros. 

Define Jq := [l,exp(Vlogx)], and Jk '■= (exp(v^loga;)2^~"'^,exp(Vlogx)2'^] for 1 < /c < 
[(logQ — y/Togx)/ log 2]. Theorem 7 of Gallagher [5] implies that the contribution of terms 
d E Jk (for /c > 1) is (for a positive absolute constant a) 



< |Q|/iexp i — a^ 



logx 



Vlog X + {k + l)logz 



Vlogx + k log z 
2 log 2 



^ h\Q\ex.p i— a 



y/2Togx 
V^ogz 



and also that the contribution of the terms (i G Jo is ^ |Q|/iexp(— ctv^loga;). Summing 
over k we deduce that the quantity in (5.2) is ^ | Q|/i exp(— ctv^log x/ Vlog z) which proves 
the Proposition. 
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Corollary 5.4. Suppose that exp(v^logx) < Q < x^ where h is a positive absolute con- 
stant. Select z to he the largest integer such that Y\p<z P — Q'^~°^^^ ■ Then there exists an in- 
teger q e [Q^-'^/^'^s^ogQ^ Qj^ whose prime factors all lie in [y^, z] with J2p\q l/p^~^^^°^^ > 
e^/3^ /o?^ 1 < ^ < I log 2, such that 



(5.3) 



'd{2x; q, a) — 'd{x; q, a) — 



X 



< 



X 



^{q) ^{q) 



exp 



/3 



Vlogx 
Vlog log X 



for all (a, g) = 1, where (3 is a positive absolute constant. 

Proof. When z is chosen as above we have z ~ 21og(5 so that v^Iogx <^ ^ <^ log a;. If 
z/2^ < y^then let Pj =0. If ^ > z/2^ > y^then let Pj be the set of primes in Ij, omitting 
the prime divisors vi, . . .V£ of Siegel moduli: in these cases ttj = 7t{z/2^~^) — 7t{z/2^) + 
0(logloga:). By Proposition 5.3 we can select q satisfying (5.3), with ~ z/(2-^+^ log(2/2'^)) 
prime factors in each Ij where j < [log 2;/ log 4] , so that '^„\n^/p^~^ °^^ ^ 6^/3^ for 
1 <^ < flog^. 

Corollary 5.5. Fix 1/2 > e > 0. Suppose we are given a large integer i with fewer than 
(log£)^~'^ prime divisors below log£. ^4/50 we are given exp(3(log£)"'^~'^) < Q < £^ , and 
J a positive integer < exp(v^log£). Select z to be the largest integer such that Y[p<zP — 
q2-o(i)_ j'fiQfi there exists an integer q E [Q^~'^^^°^^"^'^ ,Q], coprime to i whose prime 
factors all lie in [i/i, z] with Xlpig l/p^~^/^"^^ > e^/3^ for 1 < ^ < | log 2, for which 

^((j + l)£/2; q, a) - ^iJi/2; q, a) - i/2^{q) 



> max 

^ («,g)=i 



l/2^{q) 



< Jexp(-/3^ 
V Vlo, 



v/Iog^ 



Vlog log J 



where 13 is a positive absolute constant. 

Proof. If j > e(log2)/10 let Pj =0. If 1 < j < e(log2)/10 let Pj be the set of primes in 
Ij, omitting the prime divisors vi, . . .V£ of Siegel moduli and any prime divisors of i. Now, 



for 1 < J < J replace x by ji/2 and h by £/2 in Proposition 5.3, and sum, which yields 

^ I ^^ — V 1^1 n -h \ \r./-z: n. a\ — nnr.fz: n. a\ — r./-Zinin\ 

<^ Jexp I — a- 



7 --— - > max 

J = l qeQ 



^?((j + l)£/2; q, a) - ^(j£/2; q, a) - £/2(^(q) 



il2^{q) 



v/toP 



Vlogz 



Thus we may choose a g as described in the result, proceeding as in the proof of Corollary 
5.4. 

Corollary 5.6. Given a large integer i, let Q = & for sufficiently small 6 > 0, and 
let J be a positive integer < exp(v^IogT). Select z = 101og£. Then there exists an 
integer q E [Q^~'^/^"^^°^'^,Q], coprime to i whose prime factors all lie in [z/2,z] with 
Ep|g l/pi-^/i°g^ > 6eVlog2 /or 1 < ^ < I log 2, for which 



} max 



mj + l)i/2; Q, g) - ^ji/2; g, a) - i/2ip{q) 
i/2cp{q) 



< Jexp( -P -A; 

V Vloglog^/ 
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where 13 is a positive absolute constant. 

Proof. Let Pj = for j > 1, and let Pi be a set of tti = [26 log £/ log log £] primes in 
/i = (-2/2, z], omitting the prime divisors vi, . . .V(, of Siegel moduli and any prime divisors 
of £; this is possible since /i contains ~ 5 log 1/ log log I primes, and we are forced to omit 
at most ~ log £/ log log £. From here we proceed as in the proof of Corollary 5.5. 

Proof of Theorem 1.5. Take Q = x^ in Corollary 5.4 to obtain q which satisfies the hypoth- 
esis of Corollary 3.5. Let u := logy/ log ^ and select v± as in Corollary 3.5. We consider 
the Maier matrices Ai± with 

r log(rq' + s) if rq + s is prime, 
[ otherwise, 

where x/q < r < 2x/q and v± < s < v± + y. Let M± denote the sum of the entries of 
A4±. Using (5.3) to sum the entries in column s we see that 



Let r+ denote the row in Ai+ whose sum is largest, and let x+ := gr^ + v+ E {x,2x). 
Since there are x/q + 0{1) rows, we have 

^{x+ + y)- i}{x+) >-{! + 0{x-^/^))M+, 

X 

and then Theorem 1.5 follows from the bounds in Corollary 3.5. The analogous argument 
works for Ai-. 

Proof of Theorem 5.1. Take Q = & in Corollary 5.5 to obtain q which satisfies the hy- 
pothesis of Corollary 3.3. Let u := logy/ log ^ and select U in Corollary 3.3 so that 
U{1 - A/logt/) = -u; then put S± = [z^±], so that S± G [^^^ 2^+cViogu]_ ^^ consider 
the Maier matrices A1± with 

r log(rg + s£) if rq + s£ is prime, 
i otherwise, 

where R < r < 2R with R := [£/{2q)] and I < s < S±. Let M± denote the sum of the 
entries of 7Vf±. Using Corollary 5.5 to sum the entries in column s we see that 



2ip{q)\ \ ^\ ^\/loglogxJJJ ^ 



S<S± 

(s,g)=l 



Now, the sum of the entries in row r equals if (r, £) > 1; and equals 'd{£S± + qr; £, qr) — 
'd{qr; £, qr) = 'd{£S± + qr; £, qr) if (r, £) = 1, since qr < £ and qr is not prime. The number 
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of integers re [R,2R] with (rj) = 1 is Rip {i)/i + 0{T{i)) = ip{i)/{2q) +0{t). Therefore, 
denoting by r+ the row for which 'd{x+;£,a+), with x+ = £S+ + qr+ and a+ = gr^, is 
maximized, we obtain 

^{x+; i, a+) > ^(1 + 0(r i/2))M+, 

and then Theorem 5.1 follows from the bounds in Corollary 3.3. The analogous argument 
works for Ai-. 

Proof of Theorem 5.2. We proceed exactly as in the proof of Theorem 5.1 but replacing 
the use of Corollary 3.3 by Corollary 3.4, and the use of Corollary 5.5 by Corollary 5.6. 



6. Further examples 



6a. Reduced residues. 

Let q be square-free. Writing 



we see that this is 
(6.1) 



n<.x d\q n<x 

(n,(j) = l d\n 

n=a (mod £) n=a (mod £) 



if(a,Q,£)>l 

j'^^f^+Oiriq)) if(a,,,£) = l. 



Corollary 6.1. Let q be a large square-free number, which satisfies (1.3), and define a := 
0-Og\ogq)~^^, (log p)/p with 7] = min(l/100, q;/3). Then for rj {log q)'^^'^ > u> b/rf 
there exist intervals I± C [q/4,3q/4] of length at least (logg)" such that 

J2 1> ^|/+|(l+exp(^--(l + 2577)log(2V??' 

16/+ ^ ' 



n 
(n,g) = l 



and 



^(^) I r I ^ „,,„ f ^ 



\^ 1< ^i^|/_|('i_expf--(l + 25?7)log(2w/?73' 
^ a \ \ n 



nel- 
(n,g) = l 

Deduction of Corollary 1.2. This follows immediately upon noting that z < {logq)^ and 
replacing u/r] by u. 

Proof of Corollary 6.1. Take a{n) = 1 ii n < q with (n, g) = 1 and a{n) = otherwise. 
Recall that rj = min(ct/3, 1/100). Since q has at most log ij/ log log q prime factors larger 
than log q we see that 

l^ p - ■ 

p>logq 
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Therefore from our assumption that Xlpio logp/p = a log log g we may conclude that there 
exists (logij)'' < z < (log(j)/3 such that 



(«-2) E I 



p 



> rf. 



Take i. to be the product of the primes in [z^~'^ , z] which divide q, so that £ is a divisor 
of g and £ < 6^*^^"'"°*^^^'' < ^3. Given (logg)''/^ >u> b/rf we obtain by (6.2) and Corollary 
3.2 (we check readily that rj > z~^l^^ using r\ > 20 log log log g/ log log g) that there exist 
points u± G [UjU{l + 22r])] such that, with y± = [-z""*], 



(6.3a) Yl 1 > (l + exp(-Ml + 2577) log (^)))^y+ 



(n,^) = l 



and 



2u 

~~2 



(6.3b) Yl l<(l-e^p(-u{l + 25r]) log ( 

n<.y_ 

{n,£) = l 

Consider now the "Maier matrices" Ai± whose (r, s)-th entry is {R + r)£ + s with 
1 < r < R and 1 < s < y±, and R = [q/{4i)]. As usual we sum a{n) as n ranges over 
the elements of this matrix. Using (6.1), note that the s-th column contributes unless 
(s,£) = 1 in which case it contributes R(/){q/i)/{q/£) + 0(r(g)). Thus the contribution of 
the matrix is 



s<y± 



Corollary 6.1 follows immediately from (6.3 a,b). 

Proof of Corollary 1.1. Let g = HpeP^' ^^^^ thaX A, the set of integers up to g without 
any prime factors from the set V, is a subset of [1, g] of density (j){V)/V = 4>{q)/q, which is 
strictly < 1 by hypothesis. Let i = Y[^<p<z/3 pevP ^° ^'^^^ ^ — g^"'^"*""'-^-* for some fixed 
5 > 0, and apply the argument in our proof of Corollary 6.1 above. In place of Corollary 
3.2 we appeal to Corollary 3.3 (with i in place of g, since the hypothesis on V implies that 
-ff(^) ^ e^/^), and see that for suitably large u < y/z there exist integers y± > z'^ such 
that 

J2 1> (l + exp(-w(logw + loglogw + 0(l)))^y+. 






and 



m 



n<.y— 



1 < f 1 — exp(— w(logtt + log log w + 0(1)) )^—y 
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We conclude that for large u < -J~z there are intervals I± C [q/4, 3(j/4] of length > z^ such 
that 

^ 1 > rM|/^|^ +exp(-w(logw + loglog'U + 0(l))' 



nel+ ^ 

{n,q) = l 



and 



Y^ 1 < ^|/_|('l-exp(-w(logw + loglogw + 0(l))y 



nel- 
{n,q) = l 



Finally, by Proposition 3.8 we find that we can take y± < z"+^, provided that u < 

(1 — e) log log z/ log log log z. 

From the "fundamental lemma" of sieve theory (see [7]), it follows that these estimates 
are essentially optimal. 

Example 6. Take q = Y[p<zP ^^'^ ^ ^° ^^ ^^^ integers below q that are coprime to q. 
We show how to tweak A to obtain a set B C [1,^] such that the symmetric difference 
\{A\B) U {B\A)\ is small, but such that B is well distributed in short intervals. 

Let k = [g/(logg)^] and divide [1, q] into k intervals [mh + 1, (?n + l)h] for 1 < m < k, 
and h = q/k = (logg)^ + 0(1). For 1 < m < k consider whether 

(6.4) Yl l-^h<{logq)^ 

m,^+l<n<(m+l)/i 
(n,q) = l 

holds or does not hold. If (6.4) holds then take i3n[?Ti/i+l, (?n+l)/i] = ^n[m/i+l, (?b+1)/i]. 
Otherwise pick an arbitrary set of \'^{(i)hlq\ numbers in \mh + 1, [m + l)/i] and take that 
to be i3 n {mh + 1, (m + \)h\. 

By construction we see that for any interval [x, a; + y] C [1, g] 



(6.5) Y l = ^y + 0{ 



h + 



x<n<x+y 

Thus B is well distributed in short intervals of length y > (logq)^, say. 

Further note that A and B are quite close to each other. Indeed, from the Theorem in 
Montgomery and Vaughan [13] we know that for integers r > 1 

m<k ■m.h+l<n<{m-'rl)h 
(n,g) = l 

It follows that the number of values m for which (6.4) does not hold is ^j. q/(logq)'^'^. 
Therefore for any r > 1 

(6.7) \{A\B)U{B\A)\'^rq/{logqy. 
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By (6.1) and (6.7) we therefore see that B is also weU distributed in arithmetic progressions 
a (mod £) provided £ <^r 0-OgqY. 

Now take i = Y[^<p<wP = e^^~^"^^'>^^ with w < ^ so that i\q. For u < y/w but large, 
our usual Maier matrix argument then gives that one of the following statements holds: 
(i) There exists y G [q/4:,q] and a (mod £) such that, with 5((a, £) = 1) being 1 or 
depending on whether (a, £) = 1 or not, 

B{y;i,a)-S{{a,i) = l)^^^y > exp(-w(logw + loglogw + 0(l)))^^^y. 

(ii) There exists an interval [x, x + y] C [1, g] with y > w^ such that 

y 1 y > exp(— w(logtt + loglogw + 0(l)))y. 

x<n<x+y 
neB 

But from (6.5) we see that case (ii) cannot hold if w'^ > (logg)^ and if 
^-u{iogu+iogiogu+o{i)) ^ 1/logQ. We conclude therefore that the distribution of B in 
arithmetic progressions is compromised, and that case (i) holds in this situation. In par- 
ticular the expected asymptotic formula for B{y; £, a) is false for some i <^e exp((logq')'^). 

Our argument also places restrictions on the uniformity with which Montgomery and 
Vaughan's estimate (6.6) can hold. Given h and q with h(j){q)/q large, define rj by q/(j){q) = 
{h(t){q) / q)"^ . We now show that if (6.6) holds then r < {\og{h(t){q) / q)f+^'^+°^^\ 

Fix e > 0. Choose L so that h(j){q)/q = L^^*^; then let u — {1 — e) log L/ log log L and 
«; = (logL)3+4'7+4.. 

We follow the same argument as in example 6, but replacing "(logg)"^" in (6.4) by 
'%h(j){q)/q)/U\ from which it follows that we replace "y/logg" in (6.5) by ''y/U\ and 
that the upper bound in (6.7) is <^ qh{2r / eL'^Y . Case (ii) cannot hold in our range by 
(6.5). By the combinatorial sieve we know that \A{y; i, a) — 5((a, i) = l)(j){q/£)y/q\ <^ 2^*^^^ 
so that, since case (i) holds, \{A\B) U {B\A)\ ^ (/}{q)/£L in our range. By (6.7) we deduce 
that e^+°("') = lL{q/(t){q))h > (eL72r)^ so that r < zw/logL = {\ogLf+^'^+^\ which 
implies the result. 

6b. 'Wirsing Sequences'. 

Let P be a set of primes of logarithmic density a for a fixed number a G (0, 1); that is 

logp 



J]^ = (a + o(l))loga:, 

p<x 



p<-x 



as X — !► oo. Let A be the set of integers not divisible by any prime in V and let a{n) = 1 
if n G ^ and a{n) = otherwise. Wirsing proved that (see page 417 of [18]) 

(6.8) Aix) ~ f" X n (l - - 

r(l — a) -'■-'- V p 

p<x 
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Let h be the multiplicative function defined hj h{p) = if p E V and h{p) = 1 if p ^ V 
and take fq{a) = h{{a,q)) and 7^ = Ilpigpepll ~ ^/p)- Naturally we may expect that 

A{x; q, a) ~ '^ ' A{x) and our work places restrictions on this asymptotic. 

Let u > max(e^'", e^*^*^) be fixed. Then for large x we see that the hypotheses of 
Theorem 2.4 are met with z = (logx)/3 and r] = 1/logtt. Combining Theorem 2.4 with 
(6.8) which shows that A{x)/x varies slowly (and therefore equals (1 + o{l))A{y)/y for 
any y G (x/4, x)) we attain the following conclusion. 



Corollary 6.2. For fixed u > max(e 



2/a „100^| 



and large x there exists y G (x/4, x) and an 



arithmetic progression a (mod i) with i < a:(3/logx)'" such that 

fe{a) 



A{y;i,a) 



£^e 



A{y) > exp(— 'u(log'u + ©(loglog-u))) 



Ay) 



Similarly using Theorem 2.5 with rj = 1/logw and z = (^logx)"^' we obtain the fol- 
lowing "uncertainty principle" showing that either the distribution of A in arithmetic 
progressions with small moduli, or the distribution in short intervals must be compro- 
mised. 

Corollary 6.3. Let u > max(e^/", e-*^*^*^) be fixed and write u = MN with both M and N 
at least 1. Then for each large x at least one of the following two statements is true. 

(i) There exists y G (x/4, x) and an arithmetic progression a (mod q) with 
q < exp((logx)"M') such that 



A{y;q,a) 



^^A{y) :^eM-u{logu + 0{loglogu)))^. 



(a) There exists y > (ilogx)^ and an interval (v^v + y) C {x/A^x) such that 



A{v + y)- A{v) 



A{v) 



^ exp{—u{logu + 0(loglogti)))y 



A{v) 



6c. Sums of two squares and generalizations. 

Example 3, revisited. We return to Balog and Wooley's Example 3, the numbers that 

are sums of two squares. It is known that (see Lemma 2.1 of [1]) 



(6.9) 



A{x; q, a) 



^^^"U(x)ri+o^^^°s2^ 



Qlq 



logo; 



Take q to be the product of primes between ^logx and log x/ log log x. Using the Maier 
matrix method, (6.9) and our Corollary 3.3 we obtain that for fixed u and large x there 
exist y± > (logx)" and intervals [v±,v± + y±] C [x/4, x] such that 






a{n) > (1 + exp(— 'u(log'U -|- loglogw + 0(l))))y. 



+ " 



A{x) 



X 
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and 

EA(x) 
a{n) < (1 — ex.p{—u{logu + loglogw + 0(l))))y_ . 

V— <n<D_+3/_ 

These are of essentially the same strength as the results in [1] . 

Further we also obtain that (for fixed u) there exists y G (x/4, x) and an arithmetic 
progression a (mod i) with £ < a:/(logx)" such that 

Aiy;i,a)-^A{y)\>eM-nilogu + loglogu + 0{m^. 

Example 7. Let K he a number field with [K : Q] > 1 and let R be its ring of integers. 
Let Ci, . . . , C/i be the ideal classes of R, and define A^'^^ to be the set of integers which 
are the norms of integral ideals belonging to C^. From the work of R.W.K. Odoni [14] we 
know that 

'^^'^^^^^'^7logx)i-^W 

where c^ > is a constant and E{K) denotes the density of the set of rational primes 
admitting in K at least one prime ideal divisor of residual degree 1 . It is well known that 
E{K) > l/[K : Q] and also we know that E{K) < 1 — 1/[K : Q] (see the charming article 
of J-P. Serre [16]). 

We now describe what the natural associated multiplicative functions h and fq should 
be. Define 6{n) = 1 is n is the norm of some integral ideal in K and 6{n) = otherwise. 
Clearly n E A^^' for some i if and only if 5{n) = 1. Naturally we would expect that 









EtoSip')/p' 



and so the natural definition of h is to take 



p^ J2T=o^(p^)/p^' 

Note that h{p) = 1 if d{p) = 1 (which happens if p has a prime ideal divisor in K of 
residual degree 1, and so occurs for a set of primes with density E{K)) and that h{p) < 
1/p + 0{l/p^) if 5{j>) = (and this happens for a set of primes of density 1 — E{K) > 0). 
With the corresponding definition of fq{a) we may expect that for {q,S) = 1 (for a finite 
set of bad primes S including all prime factors of the discriminant of K) 

A(^\x;q,a)r^^^A^'\x). 

By appealing to standard facts on the zeros of zeta and L-functions over number fields 
one can prove such an asymptotic for small values of q (for example, if q is fixed). Our 



AN UNCERTAINTY PRINCIPLE FOR ARITHMETIC SEQUENCES 35 

work shows that that this asymptotic fails if q is of size x/(\ogx)^ for any fixed u. We 
expect that one can understand the asymptotics of A^'^\x] q, a) for appropriate smaU q in 
order also to conclude that the distribution of A^'^' in short intervals (of length (logx)") 
is compromised. We also expect that similar results hold with R replaced with any order 
inK. 

Example 8. Let /c be a fixed integer, and choose r reduced residue classes ai, . . ., a^ 
(mod k) where 1 < r < (/>(/c). Take A to be the set of integers not divisible by any 
prime = a^ (mod k) and take S to be the set of primes dividing k. Here h is completely 
multiplicative with h{p) = if p = Oj (mod k) for some j and h{p) = 1 otherwise, and 
fq{a) is defined appropriately. This is a special case of a Wirsing sequence, and so (6.8) 
and Corollary 6.2 apply (we see easily that i in Corollary 6.2 may be chosen coprime to 
k). Note also that Example 3 essentially corresponds to the case k = 4 and ai = 3. This 
also covers Example 7 in the case when K is an abelian extension. 

We may apply standard techniques of analytic number theory to study A{x) and 
A{x;qja). Consider the generating function A{s) = Xl^i '^(^)^~* which converges ab- 
solutely in Re(s) > 1 and satisfies the Euler product Hp^ai (mod A:)(-'- — P~^)~^- Further 
using the orthogonality relations of characters t/j (mod k) we see that 



Tp (mod k) 

where B is absolutely convergent in Re(s) > 1/2. Further for a character x (mod q) with 
(g, k) = 1 we get that 

n=l ip (mod k) 



with -B(s, x) absolutely convergent in Re(s) > 1/2. For large x, if g < exp(vloglr) with 
(g, k) = 1 is such that for all characters x (mod qk) (primitive or not) L(s, x) has no zeros 
in (J > 1 — c/ log{qk{l + \t\) then we may conclude by standard arguments that 

(6.10) A(x] q, a) = ^!^^A(x) + 0{x exp(-CJlogx)), 

for some constant 1 > C > 0. Since k is fixed we may suppose that no divisor of it is 
a Siegel modulus. Let z/i, . . . , ft denote the Siegel moduli below eyi\)[^J\ogx) (see §5 for 
details, and note that t -C log log x), and select a prime factor Vi for each z/j. Choose q 
to be the product of primes between ^Jw and w with w = (C/10) Vlogx, taking care to 
omit the primes vi, . . . , vt which fall in this range. Then (6.10) applies to this modulus 
q (which is of size exp((C/10 + o{l))^ylogx) and applying the Maier matrix method (and 
our Corollary 3.3) we deduce that for large u < ^/w there exists an interval [v, v + y] in 
[x/4, 3x/4] with y > (logx)^ such that 

A(x) A(x) 

(6.11) A{v + y) -A{v)-y^^-^ > exp(-2'u(logw + loglogw + 0(l)))y- ^ ^ 



X 



X 

Arguing more carefully, using a zero density estimate as in §5, it may be possible to improve 
the right side of (6.11) to exp(— w(log'U + log log -u + 0{l)))y—^. 



36 ANDREW GRANVILLE AND K. SOUNDARARAJAN 

6d. The multiplicative function z^^'^^ for z E (0,1). 

Take a{n) = z^^^' where 0(?i) denotes the number of prime factors of n counted with 

multiphcity and 2 is a fixed number between and 1. We take 5 = and h{n) = z^^^' 

and fq{a) = 2*^((°^'9)). For large x we know from a result of A. Selberg (see Tenenbaum 

[17]) that 

(logx)^-i 
A{x) ~x — — - — . 

r{z) 

From Theorem 2.4 and the above, we deduce that for fixed u > max(e^' *^^~^-', e^*^*^) 
and large x there exists y G (x/4, x) and an arithmetic progression a (mod i) with 
£ < x{3/ logx)" such that 

A{y;i,a)-^Aiy) » exp(-^log« + 0(loglogtx)))4^. 

Suppose q < exp(y^Ioglr) is such that for every character x (mod q) (primitive or not) 
L(s, x) has no zeros in a > 1 — c/ log{q{\t\ + 2)) for some constant c > 0. Then following 
Selberg's method we may see that for some 1 > C > 

(6.12) A(x; q, a) = ^^A(x) + 0(x exp(-CJh^)). 



Let vi, . . . , Vr he the Siegel moduli below e^^°^^ (see §5 for details; and note that r <^ 
log log x), and select one prime factor Vi for each z/^. Choose q to be the product of primes 
between y/w and w for w = (C/10) y^Iog^, taking care to omit the primes vi, . . . , Vr should 
they happen to lie in this interval. Then (6.12) applies to this modulus q, and using the 
Maier matrix method and appealing to Corollary 3.3 we find that for large u < y^ there 
exists an interval [v, v + y] C [x/4, 3x/4] with y > (logx)'" such that 

A(x) A(x) 

(6.13) A{v + y)- A{v)-y^^-^ > exp(-2w(logw + loglogw + 0(l)))y- ^ ^ 



X 



X 



Taking greater care, using a zero density argument as in §5, it may be possible to improve 
the right side of (6.13) to ^ exp{—u{log u + log log u + 0{l)))yA{x)/x. 

7. An UNCERTAINTY PRINCIPLE FOR INTEGRAL EQUATIONS 

E. Wirsing [18] observed that questions on mean-values of multiplicative functions can 
be reformulated in terms of solutions to a certain integral equation. We formalized this 
connection precisely in our paper [6] and we now recapitulate the salient details. Let 
X '■ (0, 00) — > C be a measurable function with xif) = 1 for < t < 1 and |x(t)| < 1 for all 
t > 1. Let cr(w) = 1 for < tt < 1 and for tt > 1 we define a to be the solution to 

(7.1) ua{u)= xit)(^{u-t)dt. 

Jo 
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In [6] we showed that there is a unique solution (j{u) to (6.1) and that criu) is continuous 
and |a"('u)| < 1 for all u. In fact a(tt) is given by 

(7.2a) a{u) = l + J2—T^Ij(^-^x), 

where 

(7.2b) lAu;x)= [t,,...,t,>i ^—^■■■^—^dt^■■■dt,. 

Jti + ...+tj<u ^l ^3 

The connection between multiplicative functions and the integral equation (7.1) is given 
by the following result which is Proposition 1 in [6]. 

Proposition 7.1. Let f he a multiplicative function with \f{n)\ < 1 for alln, and f{n) = 1 
for n < y. Let '&{x) = J2p<x logP ^'^^ define 

X{u) = Xfiu) = — — J2 f(P) logP- 

Then xif) is a measurable function with |x(t)| < 1 for all t and xif) = 1 for t < 1. Let 
(j{u) be the corresponding unique solution to (7.1). Then 



i5^/(n)=a(z.)+of-^ + i^Y 



n<y 

The converse to Proposition 7.1 also holds (see Proposition 1 (converse) of [6]) so that 
the study of these integral equations is entirely analogous to the study of mean- values of 
multiplicative functions. Translated into this context our oscillation results of §3 take the 
shape of an "uncertainty principle" which we will now describe. 

We define the Laplace transform of a function / : [0, oo) — > C by 



/•OO 

£(/,«)= / f{t)e-^'dt. 
Jo 



If / grows at most sub-exponentially then the Laplace transform is well-defined for complex 
numbers s in the half-plane Re(s) > 0. From equation (7.1) we obtain that for Re(s) > 

(7.3) C{ua{u),s) = C{x,s)C{a,s). 

Moreover from (7.2b) we see that when Re(s) > 

i-x(^) 



(7.4) sC{a,s) = exp(-c(- 



Finally, observe that if f-^ |1 — x(t) |/t dt < oo then from (7.2b) it follows that lim^i^oo o"(w) 
exists and equals 

a^:=e- where ry := H '-^^ dt =/:fi^^,oV 
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Theorem 7.2. Suppose (Joo 7^ is such that \cj{u) — aoo\ < exp(— (tt/A) \ogu) for some 
positive A and all sufficiently large u. Then either xif) = 1 almost everywhere for t > A, 
'^^ /o°° ~i e'^^dt diverges for some C > 0. 

We view this as an "uncertainty principle" since (by choosing A = 1) we have shown 
that both |x(t) — 1| and \a{u) — aoo\ cannot be very smaU except in the case x(t) = (t(w) = 1. 

Proof. Since \(j{u) — o"oo| < exp(— (-u/A) log-u) for all large u (say, for all u>U) it follows 
that C{a — (Too, s) is absolutely convergent for all complex s. Therefore the identity 

s£(a, s) = sC{a - a^o, s) + (Too, 

which a priori holds for Re(s) > 0, furnishes an analytic continuation of sC{a, s) for 
all complex s. Suppose now that J„°° I ~x(. i\ ^ct^^ converges for all positive C. Then 

C{ ~^^" , g) is absolutely convergent for all s G C, and so defines a holomorphic function 
on C. Hence the identity (7.4) now holds for all s G C. 
If Re(s) = -i then 



/•oo 
|s£((T, s)| < 1 + |s| / |(T(ti) — (Toole^^ii-u 

f LJ (* CXD 1 

<l + \s\( 2e«"dw+/ exp(uU-^^))du 
<l + \s\ (2(e^« - 1)/C + exp (^A{C + 1) + e^^'V^) + 1 



=^(«+i) 



where we bounded the second integral by the sum of the two integrals f^ + J^au+i) 

with the same integrand. In the range of the first integral one uses u{^ — {logu)/A) < 
e^^~^ /A, and in the range of the second integral one uses u{^ — {logu)/A) < —u. Therefore, 
by (7.4), if Re(s) > — ^ and Im(s) <^ e^ with ^ large, then 



We now apply the Borel-Caratheodory lemma^ to — £( — ^^^, s) taking the circles with 
center 1 and radii r — $, + 1 and R — $, + 2. Since 



^ 



1^^,11 



< I -^< 1/2, 
1 V 



^This says that for any holomorphic function / we have 



max \f{z)\ < max Re/(2:) + 1/(^0)1 

\z-zo\=r H — r\z-ZQ\=R H — r 



where < r < R. 
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^,i-x(^) 



ri^^,-^) < max £(1^2^,.) «(e + l)e^^. 

\ V J |s-l|=f+l \ V / 



we deduce from the last two displayed estimates that 

k-i|=C+i 
On the other hand, for any 5 > we have 

SO that 

Taking 5 = 21og^/^ and letting ^ ^ oo, we deduce that /^ ex(-") ^^ ^ q. ^j^^^^ jg^ 
x(t') = 1 almost everywhere for v > A. This proves the Theorem. 
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